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0.  Introduction 


The  theory  of  Ito  stochastic  differential  equations  has  been  applied  with 
great  success  to  stochastic  nonlinear  filtering  and  control  theory.  It  is  con¬ 
venient  to  begin  with  a  brief  outline  of  the  main  developments  of  nonlinear  fil¬ 
tering  that  concern  us  in  this  paper. 

Let  the  unobserved  signal  process  X  =  (Xu)  be  a  Markov  process  taking  values 
in  F  .  It  is  assumed  that  the  generator  of  X  is  known  or  that  it  satisfies  an 
Ito  stochastic  differential  equation.  The  canonical  model  of  the  observation 
process  is  given  by 

(0.1)  Yt  =  hu(Xu)du  +  Wt  ,  0  <  t  <  T  , 

where  h:  [0,T]  *  F^  ->  F^  is  a  measurable  function  such  that 
(0.2)  Jj  |hu(Xu)|2du  <  »  a.s. 

In  (0.1),  W  =  (W^)  is  a  standard,  d-dimensional  Wiener  process.  Under  very 
general  assumptions  on  the  dependence  between  X  and  Y,  Fujisaki,  Kallianpur  and 
Kunita  derived  in  [10]  a  general  stochastic  differential  equation  for  conditional 

expectations  n^Cf)  =  E[f(Xt)|Ys>  0<s<t]  for  a  class  of  f's  belonging  to  the  do¬ 
main  of  the  infinitesimal  generator  of  X.  Subsequently,  the  problem  of  existence 
and  uniqueness  of  the  solution  of  the  stochastic  differential  equation  satisfied 
by  the  optimal  filter  —  either  as  an  equation  governing  a  measure-valued  process 
or  as  a  stochastic  partial  differential  equation  for  the  conditional  density  -- 
has  been  investigated  by  several  authors  (Kunita  [17],  Szpirglas  [24],  Pardoux 
[20,21],  Krylov  and  Rozovskii  [lb].  Also  see  further  references  listed  in  [16]). 
An  equivalent  but  more  convenient  equation  to  work  with  is  the  one  for  the  unnor¬ 
malized  conditional  expectation  (or  conditional  density)  due  to  Zakai  [25]. 
Recently,  Clark  and  D  avis  have  used  the  Kallianpur-Striebel  (K-S)  formula  and 
also  the  Zakai  equation  to  obtain  a  robust  solution  to  the  filtering  problem 


2 


([5], [6], [7]). 

A  point  of  view  which  questions  the  practical  validity  of  the  observation 
model  (0.1)  has  been  put  forward  by  Balakrishnan  in  a  series  of  papers  which  are 
the  forerunners  of  the  present  work  [1,2,3].  According  to  him,  the  model  (0.1) 
is  not  suitable  from  a  practical  standpoint  because  the  results  obtained  cannot 
be  instrumented  [3].  While  this  objection  to  the  applications  of  the  Wiener  pro¬ 
cess  in  physical  problems  may  not  be  new,  Balakrishnan  goes  further  in  insisting 
that  the  theoretical  framework  for  nonlinear  filtering  must  be  faithful  to  the 
observed  phenomena  which,  in  the  present  situation,  means  working  with  a  Hilbert 
space  of  possible  observations  that  has  Wiener  measure  zero.  This  model  which  we 
designate  the  white  noise  filtering  theory  model  is  rigorously  defined  in  Section 
2.  The  noise  in  the  observation  is  modeled  not  by  the  Wiener  process  but  by  fi¬ 
nitely  additive  Gaussian  white  noise.  The  latter  is  the  same  as  the  Gaussian  weak 
distributions  that  were  first  introduced  by  Segal  in  connection  with  certain  pro¬ 
blems  of  Quantum  Physics  [22].  Nonlinear  transformations  involving  weak  distribu¬ 
tions  were  also  studied,  somewhat  later  by  Gross  ([12].  See  the  comments  in  [1]). 

The  aim  of  the  present  paper  is  to  further  develop  the  white  noise  approach 
to  nonlinear  filtering  in  the  important  special  case  when  the  signal  is  indepen¬ 
dent  of  the  observation  noise.  We  begin  by  setting  up  the  necessary  finitely  ad¬ 
ditive  framework  for  our  problem  in  the  first  three  sections.  These  include  the 
white  noise  versions  of  the  Bayes  (or  K-S)  formula  and  of  the  Zakai  equation.  In 
Section  4,  another  form  of  the  Zakai  equation  --  a  partial  differential  equation 
for  the  unnormalized  conditional  density  (in  the  finitely  additive  context)  --  is 
derived  and  the  existence  and  uniqueness  of  its  solution  in  the  distributional 
sense  is  established. 

The  last  section  is  devoted  to  robustness  questions  and  to  the  relation  be¬ 
tween  the  white  noise  and  Tto  calculus  approaches  to  the  subject.  Theorems  5.1 
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and  5.2  show  that 

(a)  the  white  noise  theory  leads  to  a  robust  procedure  when  the  observations  are 
restricted  to  the  HilbeTt  space  and 

(b)  the  robust  solutions  obtained  by  Davis  in  the  standard,  Ito  formulation  of 
the  problem  can  be  approximated  by  the  solutions  in  (a).  Further  details  are 
given  in  Section  5. 

It  is  not  easy  to  make  a  strict  comparison  of  the  results  of  Sections  4  and  5 
with  those  of  Pardoux  [20,21).  Under  somewhat  weaker  conditions  than  ours  Pardoux 
has  shown  that  the  unnormalized  conditional  density  p^fx.Y)  is  the  unique  solution 
in  the  distributional  sense  of  a  stochastic  PDE  ([21],  Corollary  3.3  to  Theorem 
3.1).  In  Theorem  3.2  of  [20]  it  is  shown  that  for  every  Y  in  C[0,t],  p^fx.Y)  is 
the  solution  in  the  distributional  sense  of  a  PDE  which  is  the  analogue  of  the 
"Zakai"  equation  of  Section  4.  On  the  other  hand,  the  conclusions  of  Theorems 
4.2  and  5.1  of  our  paper  are  stronger.  Theorem  4.2,  for  instance,  cannot  be  de¬ 
rived  as  a  consequence  of  Pardoux 's  results  or  by  using  his  methods.  Moreover, 
all  our  theorems  are  pathwise  results  which  cannot  be  obtained  with  the  technique 
of  stochastic  calculus. 

In  connection  with  his  work  on  robust  filtering,  Davis  has  introduced  a  semi¬ 
group  of  transformations  in  which  the  path  Y  £  C([0,T],F^)  figures  as  a  parameter 

[7].  An  analogous  semigroup  T*  (s<t)  but  with  y  restricted  to  H  is  defined  in 

S  j  t  I 

Section  3.  The  brief  discussion  given  there  shows  that  the  absence  of  stochastic 
formalism  makes  things  simpler  and  permits  a  more  general  definition. 

The  theory  outlined  in  this  paper  can  be  extended  to  infinite  dimensional  pro¬ 
blems.  In  this  case  the  Hilbert  space  Hy  is  replaced  by  a  Hilbert  space 
2 

. L  (fO,T];K)  of  square  integrable  functions  taking  values  in  a  separable  infinite 
dimensional  Hilbert  space  K.  Certain  aspects  of  this  problem  will  be  taken  up  in 
a  later  paper. 
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Finally,  it  must  be  mentioned  that  the  main  results  in  this  paper  have  been 
established  without  any  approximation  procedure  whatever.  In  this  sense,  the 
white  noise  approach  stands  by  itself  and  is  entirely  different  in  spirit  from 
the  many  attempts  to  find  approximations  to  Ito  stochastic  differential  equations 
and  their  solutions  discussed  at  length  in  Ikeda  and  Watanabe's  book  [131. 


] 

I 

t 

L .  A  Bayes  formula  for  a  finitely  additive  filtering  model. 

We  start  by  recalling  some  definitions  regarding  integration  with  respect  to 
cylinder  measures  on  Hilbert  spaces.  These  definitions  are  implicit  in  dross 
[12]. 

Let  H  be  a  separable  Hilbert  space.  Let  P  be  the  class  of  projections  on  M 
with  finite  dimensional  range.  For  PeP,  let  Cp  =  {P  *B:  B  a  Borel  set  in  Panne 
P}  and  let  C  =  u{Cp:  PeP).  Then,  Cp  is  a  a-field  for  each  PeP  and  hence  C  is  a 
field.  The  sets  in  C  are  called  cylinder  sets.  A  cylinder  measure  n  on  H  is  a 
finitely  additive  measure  on  (H,C)  such  that  its  restriction  to  Cp  is  countably 
additive  for  all  PeP. 

Let  L  be  a  representative  of  the  weak-distribution  corresponding  to  the 
cylinder  measure  n.  This  means  that  L  is  a  linear  map  from  H*  (identified  with 
Hj  into  L((l1  Aj.TTj)  -  the  linear  space  of  all  random  variables  on  a  countably 
additive  probability  space  (fi^A^IIj)  -  such  that 

(1.1)  n(h:  ((h,hj) ,(h,h2), . . . ,  (h,hk))  e  B) 

=  ni(CLCh1) ,LCh2) .  L(hk))  e  B) 

V 

for  all  Borel  sets  B  in  F  ,  h; .  1^  £  H  and  k  >  1.  (Two  maps  L,L'  are  said 

to  be  equivalent  if  both  satisfy  (1.1)  and  the  equivalance  class  of  such  maps  is 
the  weak  distribution  corresponding  to  n) . 

A  function  f  on  H  is  called  a  tame  function  if  it  is  of  the  form 

(1.2)  f(y)  =  ^((y.hj),...,  (y,hk)) 

k 

for  some  k  £  1,  h^,...,  hk  e  H  and  a  Borel  function  4>:F  -*■  F  .  For  a  tame  func¬ 

tion  f  given  by  (1.2),  denote  by  f ~  the  function  4>(L(h^)  ,L(h2) , . . . ,  L(hk)). 

Definition.  Let  L(H,C,n)  be  the  class  of  functions  f  on  H  such  that  the  net 
{(foP)~:  PeP)  (here  Pj  <  P2  if  Range  P1  £  Range  P2)  is  cauchy  in  -measure. 


Furthermore,  let 


f"  =  lira  in  Prob.  (f°P)~. 

PeP 

It  may  be  observed  that  "the  map  g  -*•  g"~  is  an  extension  of  that  for  tame 
functions  and  is  linear,  multiplicative  while  the  intrinsic  integration  notions 
such  as  the  distribution  of  g~  depend  only  on  the  function  g  and  the  weak  distri¬ 
bution  (or  the  cylinder  measure)  and  is  independent  of  other  arbitrary  choices 
(such  as  the  representative  L  of  the  weak  distribution  or  of  a  basis  of  H,  etc.)." 
(Gross  [12]). 

In  view  of  this  remark,  it  is  natural  to  make  the  following: 

Definition:  f  e  l(H,C,n)  is  integrable  iff  f~  is  integrable  and  in  that  case, 
we  set 

/  f  dn  =  f  f'  dllj  . 


The  finitely  additive  cylinder  measure  m  on  (H,C)  such  that 


(1.3)  m{yeH:  (y,h)£a}  =  exp(-  — j)  dx  ,  h  e  H 

v^HMI  2[|h|r 

is  called  the  eannoniaal  Gauss  measure  on  H.  The  identity  map  e  on  H,  considered 
as  a  map  from  (H,C,m)  to  (H,C)  will  be  called  Gaussian  white  noise. 

The  abstract  version  of  the  white  noise  non-linear  filtering  model  considered 
in  this  paper  is  given  by 

(1.4)  y  =  C  +  e 


where  (  is  an  H  valued  random  variable  on  a  countably  additive  probability  space 
(ft, A, IT)  independent  of  e.  To  make  (1.4)  meaningful,  £  and  e  should  be  defined  on 
a  single  probability  space.  To  this  end,  let  E  =  H  x  ft  and 

F  =  pUp  Cp  ®  * 

where  Cp  «  A  is  the  usual  product  a  field.  For  PeP,  let  otp  be  the  usual  product 
of  m  restricted  to  Cp,  which  is  countably  additive  and  IT,  (so  that  ap  is  a  counta¬ 
bly  additive  probability  on  (E,C  «A)).  It  is  easy  to  see  that  the  a  's  are  con- 

P  v 

sistent  and  thus  determine  a  finitely  additive  probability  on  P  such  that  a  =  a 


on  Cm  A. 

P 


Let  e,£,y  be  H  valued  maps  in  E  defined  by 

(1.5)  e(h,U))  =  h 

£(h,Od)  =  £(o>) 

y(h,uj)  =  e(h,u>)  +  £(h,w)  ,  (h,uj)  e  H  x  n  . 

Then,  £  is  the  signal,  e  is  the  noise  and  the  observation  y  is  given  by  (1.4). 
Lemma  1.1.  y:  (E,F,a)  ■*  (H,C)  is  measurable  in  the  sense  that 
B  =  {(h,u)):  y(h,u))eC}  belongs  to  F  for  all  C  e  C  and  further  n  defined  by 

(1.6)  n(C)  =  ct(y  e  C) 

=  /  mCfaj)  (Qdnfw)  ,  CeC 

(where  m^fC)  =  m(C  -  h) ,heH)  is  a  cylinder  measure. 

Proof:  Fix  PeP  such  that  CeCp.  Let  C  =  P  1D,  where  D  is  a  Borel  set  in 
Range  P.  Then 

B  =  {(h,oj):  P£(«)+PheD)  e  Cp  »  A  c  F  , 

Furthermore,  using  Fubini's  theorem, 

a(yeC)  =  otp{(h,u>):  PC(w)+PheD} 

=  /  m[P-1  (D-P£  (u>) )  ]dH(u)) 

=  /  m(P~1D-C(aj))  dIKu) 

=  /  m(C-C(<i>))  dll(a)) 

=  /  dlt(u>)  . 

The  finitely  additive  measure  n  is  called  the  distribution  of  y. 

Let  g  be  an  integrable  function  on  ((2, A, II).  In  analogy  with  the  usual  defi¬ 
nition  of  conditional  expectation,  we  make  the  following 
Definition:  If  there  exists  a  v  e  L(H,C,n)  such  that 

(1.7)  /  g(u))lc(y(h,u>))da(h,u>)  =  JQ  v(y)dn(y) 

then  we  define  v  to  be  the  conditional  expectation  of  g  given  y  and  express  it  as 


E(g|y)  =  v  . 

As  in  the  proof  of  lemma  1.1,  it  can  be  seen  that  the  integrand  in  (1.  7)  is 
Cp»A  measurable,  where  CeCp.  Let 


(1  -8) 


♦  (C)  =  /  g(u)l  r(y(h,w))da(h,u>)  ,  C  c  C„. 


Again,  as  in  lemma  1.1,  it  can  be  shown  that 


(1.9) 


♦  (C)  =  J  g(w)m£-^  (C)dnf(o)  . 


We  now  proceed  to  y*  the  existence  of  E(g|y).  In  fact  we  obtain  an  ana¬ 
logue  of  the  Kal  1  ianpur-Striebel  formula  for  E(gjy).  Since  we  are  going  to  use. 
it  later,  we  state  an  abstract  version  of  the  formula  here.  For  a  proof  of 
this,  see  lemmas  11.3.1  and  11.3.2  in  Kalliannur  [15]. 

Lemma  1.2.  Let  (Q.,A^,IL),  i  =  1,2  be  probability  spaces,  and  (u^  ,A_ ,!!_')  = 
C ' i j , A^ , TI ^ ® f . ^2 , TI 2 )  -  Let  pCw^.u^)  be  a  positive  A, -measurable  function  such 


j  p(w1,a)2)dni(a)1)  =  1  for  all  co2  <r  ft,,  . 

Let  X  be  a  measure  on  (ft.A^)  defined  by 

X(B)  =  j  p(u>  ,u)  )dlt  (to  ,o)  )  ,  B  e  A  . 

Bx^2  1  *  J 

Let  Aq  =  A1®{(j),fi2}. 

Let  g  be  an  integrable  function  on  (ft2,A2,H2)  and  let 

0  (B)  =  /  g(u>  )p(u>  ,u>  )dn  (w  ,w  )  . 

8  B*n2 

Then  Q  <<  X  and 

g  , 

dQ„  J  g(o>,)p(u>  ,w  )dn  (u)  ) 

E(glA0)  -  -g£<«0  =  — ~ - — - — 

0  dA  1  /  pCa)1,a>2)dn2(u2) 

Now,  let  {e^}  be  a  CONS  in  H.  Let  8  be  the  Borel  a-field  on  F  and  let 

p  =  TI.  N(0,1).  For  h  e  H,  let  p,  =  II.  ,  N(h.,l),  where  h.  =  (h,e.).  It  is 
.1-*  n  j=i  j  .1  i 
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well  known  that  y^  =  y  Ci.e.,  y^  <<  y  and  y  <<  y^) 


and 


dy. 


(1.10) 


ay  ao  oo 

-^(x)  =  exp(  I  x  h  -  y  I  h  )  , 
i=l  1=1 


x  =  (x1,x2,...)£F 


=  q(x ,h)  say. 

.  OO  00 

Let  Xj  (x)  =  Xj ,  x_  =  (Xj.x^,...)  f  R  ,  be  the  coordinate  maps  on  1R  .  Defi 

00  oo 

a  map  L  from  11  into  L(R  ,  8  ,  y)  by 


ne 


(1.11) 


L(h)(x)  =  l  <h,e.>X  (x)  . 
j  =  l  -1  J 


Tt  is  easy  to  check  that  the  series  appearing  in  (1.11)  converges  a.e.  y  and  that 

the  distribution  of  L(h)  under  y  is  N(0,||h||  ),  so  that  L  is  a  representative  of 

the  weak  distribution  corresponding  to  the  Gauss  measure  m.  Since  u,  =  y,  L  can 

0 

also  be  considered  as  a  map  into  L(R  ,8  ,vl  )  and  the  distribution  of  L(h)  under 

2  "o 

y^  is  N((h,hg)  ,||h||  )  and  thus  is  a  representative  of  the  weak  distribution  cor¬ 
responding  to  nr  .  Further,  y  =  y  implies  that  the  map  f  +  f'  is  the  same  when 
h0  n 

L  is  considered  as  a  representative  of  the  weak  distribution  corresponding  to  m 

or  m  .  Thus  for  C  e  C,  we  have 
h 

0 


(1.12) 


Vc)  =  Vco  '  ho e  H 


where  C  e  B  is  given  by 

(1.13) 


=  1C~  ' 


Now ,  1 et 

(01>A1,ni)  =  ( R , 8°° , y)  a  (P,A,H) 

v(B)  =  J0  q(x,C(w))dni(x,to)  ,  B  e  Aj 
X(D)  =  v(Dxfi)  ,  D  e  8°°  . 
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(1.14)  V  CD)  =  JD  g(aj)lD(x)dv{x,u)) 

afg.x)  =  /  g(<a)q(x,£(io))dn(w)  . 

Then,  by  Kali ianpur-Striebel -Bayes  formula.  Lemma  1.2,  we  have 

dtp  -  . 

-r^-(x)  =  =  u(x)  say. 

d  0(1,  x) 

and  hence 

(1.15)  H'^(D)  =  jD  ufx)dX(x)  ,  D  e  8°°  . 

The  following  lemma  is  an  immediate  consequence  of  (1.6),  (1.8),  (1.12), 
(1.14)  and  Fub ini's  theorem: 

Lemma  1 . 5 

(1.16)  (i)  v(AxB)  -  jB  yf(u)(A)dn(aJ)  ,  A  e  B°°,  B  e  A 

(ii)  Y  (D)  =  /g(a))yC(aj)(D)dn(u)  ,  D  e  8°° 

(1.17)  (iii)  n(C)  =  X(C~)  ,  C  €  C  and  C”  is  defined  by  (1.13) 

(1.18)  (iv)  4>  (C)  =  Y  (C~)  ,  C,C~  as  in  (iii). 

g  g 

Now,  (1.16),  (1.14)  and  the  fact  that  r  p  implies  that  X  =  p  and  hence  L 

°o  oo 

can  be  considered  as  a  map  into  L(1R  ,B  ,\).  (1.17)  implies  that  it  is  a  repre¬ 

sentative  of  the  weak  distribution  corresponding  to  the  cylinder  measure  n. 

Also,  as  remarked  earlier,  the  map  f  -+•  f~  is  the  same  for  the  cylinder  measures 
n  and  m.  Thus,  even  though  we  have  a  family  of  cylinder  measures  {m^:  heHlu(n), 
the  symbol  f~~  has  a  unique  meaning. 

Let 

(1.19)  q(y,h)  =  exp{ (y,h)  -  y||h||2} 
and 


(1  .20) 


o( g,y)  =  /g(<a)q(y,S(u>))dIl(uj)  . 
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1  2 

Observe  that  q(y,h)  s  exp(j||yll  )  and  hence  the  integral  appearing  in  (1.20) 


is  well  defined. 


Finally,  let 


Then,  we  have 


V(y)  =  — qls  >1) 

m  0(1, y) 


Lemma  1 . 4 


(i)  v  €  L(H,C,n) 


(ii)  v~  =  u 


(iii)  v  satisfies  (1.9),  i.e., 

L  v  dn  =  4>  (C)  for  all  C  e  C 
JC  gv 

Proof:  Let  denote  the  projection  onto  span  (e^,...,  e^),  k  >  1.  (Recall  that 

{e.}  is  a  fixed  CONS  in  H)  .  Let  8  =  o(X.:  1  <  j  <,  k)  .  Let  h  denote  a  generic 

J  k  3 

element  in  H  and  let  h.  =  (h,e.).  Then 

J  1 

k  00 

(1.21)  [a(g,Pkh)]~  =  [/  g(w)exp( [  h.?.-  \  \  ^)dn(w)]~ 


=  /  g (to) exp(  l  U.-  \  l  C2)dIT(co) 
i=l  i=l 


On  the  other  hand. 


1  c 


(1.22)  E  (a(g,*)|8.)  =  /  g(to)exp(  £  X.£.-  7  l  C)dll(w) 

y  K  i=l  11  £  i=l  1 

Denoting  /  |f|dy  by  ||f||^,  we  get  from  (1.21)  and  (1.22) 

k  k  00 

II  (a(g,  P.h)  ]~-E  (0(g,  •)  (8. )  ||  =  Fy/[(g(a))|exP(  £  X  ?  -  ±  l  d)(l-exp(-  \  \  5?)ldn(u0 

M  i=l  i=l  i=k+l 


(1.23) 


-  /  |g(uO|(l-exp(-  |||C||2+  7l|PkC||2)dn(co). 


If  for  P  e  P,  8p  denotes  a(L(h) :  h  e  Range  P) ,  then  as  in  (1.23),  it  can  be  shown 
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Cl-24)  II  [afg.Ph)r-Ev(afg.)|8p)||  j 

=  /  |gfu)|Cl-expf-  j||S||2+  j||p£||2))dJ!(w) 
By  the  martingale  convergence  theorem  (see  p.  96,  Neveu  [191),  we  have 


(1.25) 


{EyCofg,*)  |8p)>pep  o(g,‘)  in  L1  (u)  . 


From  (1.24)  and  (1.25),  we  get 

( [^(g.Ph)  ]~}pep ->  a(g,  •)  in  L1  (y) 

and  hence  in  y-probabil ity.  Thus  a(g,h)  e  L(H,C,n)  and  [o(g,h)]''  =  o(g,*l. 
the  map  f  -*•  f~  is  multiplicative,  we  get  v  e  L(H,C,n)  and 

v~  -  [o(g.h)]~ 

'  (a(l ,h) ] ~ 

_  o(g,x) 
a(l, x) 


Since 


Finally,  for  C  c  C  and  C~  defined  by  (1.13),  we  have 


Yc)  =  Yr) 


by  (1.18), 


=  /c~u(x)dX(x)  by  (1.15), 


«  /  (lrv) ~dX 


=  /  l„v  dn 


by  definition  of  / fdn 


=  /c  vdn  .  □ 

We  have  shown  the  existence  of  a  function  v  satisfying  (1.7)  and  thus  E(g|y) 

exists  as  a  function  in  i.(H,C,n).  We  may  remark  that  here  4>  and  n  are  cylinder 

8 

measures  such  that  for  all  e  >  0,  there  exists  a  <5  >  0  such  that  n(C)  <  (5  implies 
<tg(C)  <  e  for  C  e  C.  However,  there  is  no  general  result  known  to  us  that  implies 


d<f> 

O 

the  existence  of  the  Radon -Nikodym  derivative  in  the  class  J.fH,C,n).  What 
we  have  done  in  Lemma  1.4  is  to  produce  such  a  derivative.  We  may  add  that 
Gross  fill  has  a  definition  of  Radon-Nikodvm  derivative  which  always  exists  as 
in  the  countably  additive  theory,  but  in  his  definition,  the  derivative  is  a  ran- 

CO  00 

dom  variable  on  the  Representation  space*  ((11,0  ,X)  in  our  set  up}  and  not  on 
fH.C.ni  and  thus  is  not  appropriate  for  defining  conditional  expectation. 


We  summarize  the  results  of  this  section  below.  This  is  Bayes  formula  for  th 
conditional  expectation  in  our  set  up  analogous  to  the  Kallianpur-Striebel  formul 
(Lemma  1.2). 

Theorem  1.1:  Let  y,£  be  as  in  (1.5).  Let  g  be  an  integrable  function  on  (ft.  A,  If) 


E(g|y) 


/  g(<A)exp((y,£(u>))-  j||5(go)  ||  2) dll(co) 
/  exp(y,£(w))-  I||e(w)||  2dH(to) 


Then 


2.  A  white  noise  model  of  non-linear  filtering 


In  this  section,  we  apply  the  results  of  the  previous  section  to  the  usual 
filtering  model. 

Let  X  =  {Xs:  0  <  s  s  T)  be  a  Markov  process  on  a  probability  space  Cfl.A.II) 
and  taking  values  in  a  complete  separable  metric  space  S  and  let  ( Lg)  be  the  ex¬ 
tended  infinitesimal  generator  of  X  and  let  V  =  nP(i.g). 

For  0  <  t  <  T,  let  h^:  S  •+•  be  such  that 

(2.1)  /J  |ht(Xt)|2  dt  <  «  a.s.  n  . 

Writing  (  ,  )  for  the  inner  product  in  Fm  and  |*[  for  the  norm,  for  each  t 
in  [0,T]  define 

Ht  =  {<J>  :  [0,t]-Hm  :  |4>|eL2([0,t])}  . 

H  is  a  [filbert  space  with  the  inner  product 

'VV  =  fo  (V>2}  du  • 

A  function  <}>  e  Hj,  when  restricted  to  [0,t]  will  be  denoted  by  and  is  obvi¬ 
ously  an  element  of  . 

Let  Cg(w)  =  hgfXgCw)) ,  0  <  s  <  T.  In  view  of  (2.1),  £  =  (£g)  is  an  H  valued 
random  variable  and  for  0  <  t  <  T,  ^  is  an  H  valued  random  variable.  Let 
e  =  (eg:  0  5  s  <  T)  be  H(  valued  'white  noise'  independent  of  X. 

The  non-linear  filtering  model  considered  in  this  paper  is  the  following: 

(2.2)  y  =  h  (X  )  +  e  0<s<T 

s  s  s  s 

or  equivalently 

(2.2) '  y  =  5  ♦  e  . 

Applying  the  results  of  the  previous  section  (to  the  Hilbert  space  H  and  random 
variables  y*  =  £*  +  e*)  we  have,  for  an  integrable  function  g 

E(g| y  -  OSuSt)  =  ■  ,  yteH 

U  o(l, yr)  1 


j 


(2.3) 


2 


where 

C2.4)  ofg.y^  »  /  gCo))q(yt,CtCo)))dnf u) 

and 

(2-5)  qCy*. €*(<*>))  =  exp(<yt,Ct(i»)')>t  -  ^<^(0*)  ,Ct(w1>t)  . 

Since  q(yf, £*(«))  <  exp(^<yt,yt>)  ;  afg.y1")  is  well  defined. 

If  we  define  ot(g,y)  =  csCg.yS  and  qt(y,Q  =  qfy*,?*)  then  we  can  rewrite  (2.3) 
and  (2.4)  as 

at(g,y)  =  /  g  (a))  qt  ( y ,  £  (w) )  dll  (to) 

,  °t(g,y) 

E(g|yu:  0<u<t)  =  ,  ycHT  . 

It  may  be  remarked  that  though  yeHT  appears  as  a  parameter  in  a^Cg.y),  ot(g,y) 
is  by  its  definition,  a  nonant icipative  functional  of  y.  We  prefer  the  form  (2.3)' 
because  in  the  next  section  we  obtain  a  differential  equation  for  a^fg.y)  and  for 
this,  it  is  convenient  to  consider  a  fixed  yeHT  as  the  parameter. 


(2.4)’ 

and 

(2.3)’ 


3.  A  white  noise  version  of  the  Zakai  equation. 

In  this  section,  we  obtain  an  analogue  of  the  Zakai  equation  in  our  set  up. 
We  sill  show  that  (under  suitable  conditions)  for  feP,  we  have 


(3.1) 


It  Wy)  =  VEtft>y)  + 


where  fjw)  =  f(Xt(w))  ,  hjto)  =  l»t(X  (a>)) 

Ltf  =  Ltf  '  llht|2f  *  and  Lt7t  =  Citf)  CXt(oi))  . 

First,  observe  that  for  a  g  such  that  E |  g |  <  00 , 

(3.2)  Ot(g.y)  =  °t(E(g|^),y) 

where  FX  =  cr(X(s) :  s<t) .  This  follows  from  (2.4)'  and  the  fact  that  qt(v,£(w)) 

x 

is  F  measurable, 
t 

Fix  feV  and  let  g^:  fMR  be  defined  by 

8t(“)  =  fCXT)  -  /J(lsf)(Xs)ds  . 

Then  proceeding  as  in  [14]  it  follows  that 

(3.3)  E(g(.|F*)  =  f(Xt) 
and  hence 

(3.4)  crt(ft,y)  =  at(gt,y)  . 

Now,  g^.q^  are  absolutely  continuous  functions  of  t  and 

d  /  7 

dt  gt  ”  Ltft  ’ 

£  qt(y,^))  =  qt(y,«w)Hj  ^yj-  ||hsri 

and  hence 

gtqt(y*°  s  g0+^as7s)qs(y’C)ds+J{gsfrsys%fy'^ds 

■  I-/^s,frs|2qs(y'C)ds  • 


(3.5) 
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(3.6)  //J(US7SI  +  .i  l8sl  lhs!  Iys I  +  T^s^^s^  dsdn  4  00 
then  by  Fubini's  theorem  it  follows  that 

(3.7)  ot(gt,y)  =  Eg0  +  /0[asCi.sfs,y)  +  Wsfgs^.yly^  -  I  ag  fgs  |hg  |  2, y)  )ds 
so  that  (3.2),  (3.3),  (3.4)  and  (3.7)  give 

(3.8)  at(ft,y)  =  Eg0  +  /0tasasfs,y)  +  j  ^(f^.y)^  -  ?  a  (¥  |h  | 2,y)  ]ds 

i=l 

t  m _ . 

=  Ef0  +  /o[as^sfs’y)  +  • 

Equation  (3.8)  is  the  integral  version  of  (3.1)  and  thus  it  follows  that  (3.1) 
holds  for  all  feP  for  which  (3.6)  holds  . 

From  now  on,  we  will  consider  o  (g,y)  only  for  g  of  the  form  g(w)  =  f ( X  (uj)  ) 

L  L 

and  hence  we  modify  our  notations  slightly. 

For  a  function  f:  S  •+  1R  and  0<t^T,  such  that  E  |  f ( X^_ )  |  <  «*<  ,  we  define 

ot(f,y)  *  /  f(Xt)qt(y,C(u))dn(u)  . 

With  this  notation,  we  now  state  the  result  proved  in  this  section. 

Theorem  3.1.  Let  feV  be  such  that  (3.6)  holds.  Then  for  all  yefij,, 


(3.9) 


^ot(f,y)  =ot(Ltf,y)  ♦  ^  at(hjf,y)yj  . 


a.e.  t  in  [0,T]  . 

Remarks.  To  compare  (3.9)  with  the  usual  Zakai  equation,  define 

Ht  =  (Y:  Y  (u)  =  /“  y(s)ds  ,  OsuST  ,  y  e^} 

The  map  y  -+  Y  =  y(u)du  is  an  isomorphism  between  the  Hilbert  spaces 
and  (equipped  with  the  appropriate  inner  product).  Writing  ot(f,Y)  =  ot(f,v) 
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where  Y  and  y  are  related  as  above,  (3.9)  can  be  rewritten  as 

m 

(3.9)'  dOt(f,Y)  =  at(Ltf,Y)dt  +  l  0  (fh*  Y)dY*  . 

i  =  l 

(3.9)  '  is  an  analogue  of  the  Stratonovich  version  of  the  Zakai  equation  (Davis 
and  Marcus  [8]). 

The  unnormalized  conditional  expectation  a  ( f;y)  and  the  Zakai  equation  (3.9) 

can  be  expressed  by  means  of  a  semigroup  (Ty  )  ,  0<s<t<T,  yeH  as  follows. 

s ,  t  t 

Write 

q^Cy,S(w))  =  exp[/^(yu<?u(iu))du  -  j  f*  |  ?u(a>)  |  2du]  ,  0<s<t<T  . 

s 

Then  for  each  yeH^,  qt(y,*)  is  a  multiplicative  functional  of  the  Markov  process 

X  (recall  that  £  (w)  =  h  (X  (co) ) )  .  Hence  T^  .  defined  by 
u  u  u  s,t  J 

(3.10)  (TyS't  f)(x)  =  E[f(Xt)q*(y,£)|Xs=x] 
is  a  semigroup,  i.e.,  for  each  y  in  HT, 


(3.11) 


Ty  Ty  =  Ty 
s,u  u,t  s,t 


0<s<u<t<T  . 


Furthermore , 

(3.12)  /  d(Tj  t  f)(x)  dT(x)  =  Ot(f,y) 
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where  T  is  the  distribution  of  XQ.  In  this  set  up,  the  Zakai  equation  (3.9)  can 
be  written  in  the  form 


CJ.m  «  •  ro.t'£tf  ♦  o.t.yt)fl  • 

From  the  semigroup  property  and  (3.13)  it  can  be  shown  that  the  generator  Ay  of 

{Ty  }  has  the  form 
s,t 

Aj>  =  Lt f  ♦  [(ht,yt)  -  y|ht|2]f  . 


(3.14) 
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The  semigroup  (T^  t)  is  similar  to  but  not  identical  in  definition  to  the  semi¬ 
group  introduced  by  Davis  in  [7],  The  objective  of  [7]  is  to  establish  a  formula 
analogous  to  f 3 . 14)  for  the  extended  generator  of  Davis's  semigroup,  the  underlying 
idea  being  that  the  semigroup  determines  the  unnormalized  optimal  estimate  o^ff.yl. 
The  white  noise  approach  appears  to  be  more  general  and  is  considerably  simpler. 

The  assumption  that  hfX^)  is  a  semimartingale,  made  in  [7]  is  unnecessary.  More¬ 
over,  the  difficulties  connected  with  the  It o  stochastic  integral  with  respect  to 
the  semi mart in gale  hfX^)  simply  do  not  appear  in  our  treatment  of  the  problem.  We 
will  not  pursue  the  question  further  in  this  paper. 


-  i  -  ■ 
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\ .  Zakai  equation  for  the  unnormalized  conditional  density. 

In  the  rest  of  the  paper,  we  show  the  existence  of  the  "unnormalized  condi¬ 
tional  density"  in  our  finitely  additive  set  up  and  show  that  it  is  a  solution  of 
a  partial  differential  equation  (also  called  the  Zakai  equation) ,  in  which  yeH^ 
appears  as  a  parameter  in  the  coefficients. 

For  yeHj,,  let  be  the  finite  positive  Borel  measure  on  S  defined  by 

(4.1)  r*(B)  =  /  lB(Xt(u))qt(y,C(u))dn(u>) 

=  ot(lB,y) 

Then,  using  usual  arguments,  it  can  be  shown  that  for  f  such  that  E |  f  (X^.)  1  <  °°, 

It  2 

(recall  that  qt(y,£(w))  is  bounded  by  exp(j/Q|y|  du)) 

(4.2)  at(f,y)  =  /f(x)dr*(x) 
and  hence 

E(f (X  )  |y  :  0<ust)  =  — J—  /f(x)drVx)  . 

r  u  r(S)  y 

y 

Thus  F*  is  the  unnormalized  conditional  distribution  of  given  (v  :  0<u<t). 

If  the  X  is  Rd -valued  and  admits  a  density  Pt(x,y)  with  respect  to  the  Le- 
besgue  measure,  we  call  it  the  unnormalized  conditional  density  of  X^  given 
(y  :  u<t}. 

Theorem  4 . 1 

Assume  that  Xt  is  a  diffusion  process  with  state  space  Rd  ,  initial  density 
4>  and  generator  L  given  by 

Ltf(xl  =  .  I  .  f(x)  +  ?bi(t-x)^7f(x) 

i,j=i  j  i  i 

for  fec”(Rd)  , 

where  a(t,x)  =  ((a.  ..(t.x)))  is  positive  definite,  a.^t.x),  b^t.x)  are  bounded 
measurable  functions  from  [0,t]*ltd  -*R  and  further 


i 

t  —  - . .  MiaagBBM  ■■■■'• 


(4.3) 


i, j=l , • • -d  . 


lim  sup  sup  |a.,(t,x  )-a. .(t,x  ) |  =0 

S-0  |x1-x2i«S,  0<«T  13  1  1J  2 

Xj ,x2e F 


Then  there  exists  a  measurable  function  pt(x,y'),  0st<T,  xeF  .  ycH^  such  that 


(4.4) 


Ot(f,y)  =  /  f(x)pt(x,y)dx  ;  yeHT  , 


for  all  f  such  that  EjffX^)!  <  00  . 

Further,  pt(x,y)  satisfies  the  following  partial  differential  equation  (white 
noise  version  of  Zakai's  equation)  in  the  distributional  sense 

P0(x,y)  =  4>(x) 

(4.5) 

D  -s, 

Pt(x,y)  =  1-*  Pt(x,y)  +  (ht,yt)pt(x,y)  ,  yeH? 


where  L^f  =  L^f-  ylh^l^f 


1 ,  _d 


Also,  for  all  0^t<T,  y->Pt(*,y)  is  a  continuous  map  from  Hj.  into  L  ( F  ) 

1  00 

equipped  with  o(L  ,L  )  topology. 

Proof:  Let  Tt(B)  *  IKX^eB)  .  Then  (4.3)  implies  that  P*  is  absolutely  con¬ 
tinuous  with  respect  to  the  Lebesgue  measure  X  on  F^  (see  Stroock-Varadhan  [23), 
Theorem  9.1.9).  So  if  B  is  a  Borel  set  in  1R^  such  that  X(B)  =  0,  then  Tt(B)  =  0 
and  hence  from  (4.1)  r*(B)  =  0.  Thus  <<  X.  Also,  from  (4.1)  it  is  easy  to 
check  that(t  ,y)-*F*(B)  is  a  Borel  measurable  function  of  (t,y)  for  all  Borel  sets 

B  in  F^  .  Thus  we  can  choose  a  Borel  measurable  version  p  (x,y)  of  the  density 
dF* 

.  From  (4.2)  it  follows  that  the  density  Pt(x,y)  satisfies  the  required  con¬ 
dition  (4.4). 

From  the  definition  of  a^ff.y),  it  is  clear  that  y-K?t(f,y)  is  a  continuous 
map  for  all  t,  for  all  bounded  f.  Thus  y+p  (»,y)  is  a  continuous  map  from  H^. 

Id  1  00 

into  L  (F  )  equipped  with  the  o(L  ,L  )  topology. 

It  remains  to  show  that  p^  satisfies  (4.5).  For  this,  first  observe  that 
boundedness  of  a„,b.  implies  that  the  integrability  condition  (3.6)  holds  for 


3 


oo  d 

a]]  r  in  )  and  hence  (3.9)  is  valid  for  such  functions  f.  We  can  rewrite 

(3.9)  as 

(4.6)  /  f(x)pt(x,y)dx  =  /  f(x)<J>(x)dx  +  f*f(L^f)  (x)p^(x ,y)dxds 

+  j’^/f(x)(hs(x)  ,ys)ps(x,y)dxds 

for  feC'”(Kd)  . 

°0 

Now  for  feCpOR  )  (defining  pt(x.y)  =  0  if  t  t  [0,T])  we  have,  from  the 
definition  of  the  distributional  derivative, 

(4.7)  JJ  f(t.x)-^-  pt(x,y)dxdt  =  -fj  f  (t  ,x)  lpt(x,y)dxdt  . 

Applying  (4.6)  to  the  function  f(t,*)  and  integrating  with  respect  to  t 
over  1R  ,  we  get 

f4-8)  fj  f(t,x)-^r  pt(x,y)dxdt  =  -//  ~  f  (t ,  x)  r|)(x)  dxdt 

'//  f /q  LS0'X  f(t,x)ps(x,y)dsjdxdt 

"  H  t/o  Tt  fCt»xKhs(x)  ,ys)Ps(x,y)ds]dxdt 
=  1  4  1 1  *■  II I  say . 

Now,  by  Fubini’s  theorem  we  have 

(4.9)  I  =  -/  <Kx)[/  3“  f(t,x)dt]dx 

=  0 

(4.10)  II  -  -  //  [£  Ls ^  f(t,x)dt]ps(x,y)dsdx 

=  //  (Igf) (s,x)ps(x,y)dsdx 
=  //  f(s,x)[I*  ps(x,y)]dsdx  . 

Observe  that  f^f  h^(x)p^ (x,y)dxdt  =  F.Jq  h^(Xt)dt  •"  00  and  hence  again  by  iuhini’s 


theorem,  we  have 


4 


(4.11) 


111  =  '//  f(t,x)dt] (hg(x) ,ys)ps(x,y)dsdx 


=  //  f(s,x) (hg(x) ,ys)ps(x,y)dsdx  . 

Now,  combining  (4.8),  (4.9),  (4.10)  and  (4.11)  we  conclude  that  p  satisfies 
(4.5). 

Remark:  It  is  worth  noting  that  boundedness  of  h  is  not  required  in  Theorem 

4.1  . 


Having  proved  the  existence  of  the  unnormalized  conditional  density  pt(x,y) 
and  having  shown  that  it  is  a  solution  to  the  Zakai  equation  (4.5),  we  now  turn 
to  the  problem  of  uniqueness.  This  is  very  important  from  the  point  of  view  of 
appl ications. 

The  next  result  shows  that,  under  additional  conditions  on  the  coefficients 
a i i , h i ,  the  Zakai  equation  has  a  unique  solution  provided  y  is  restricted  to 
(i.e.,  yeH_  and  bounded)  and  h  is  assumed  to  be  bounded.  Thus,  the  boundedness 
assumption  on  h  occurs  only  in  the  uniqueness  part  of  the  problem.  More  inter¬ 
estingly,  the  two  theorems  together  show  the  following: 


(a)  Tf  yeL^,  the  Zakai  equation  has  a  unique  solution  given  by  the  unnor-,-  ted 
conditional  density  p  (*,y); 

(b)  Furthermore,  pt(*,y)  for  any  yeH^  can  be  obtained  from  (a)  and  by  approxima- 

00 

tion  in  view  of  the  last  assertion  of  Theorem  4.1  and  the  fact  that  L,p  is  dense 

in  H,p. 

Let 


V  =  (ueL2(Fd)  :  ^-eL2(Rd)} 

«L2(Fd)  means  that  the  distributional  derivative  is  given  by  an  L2-function) 


Let 


lullv  =  (/  |u|2dx  +  f  f  | 2dx) 
i  =  l  i 
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and 


I2([0,T],V)  =  (g:  [O.Tl-V:  [T  ||  *(t)  Hjdt<->  . 


f  Q  V« 

With  these  notations,  we  have  the  following  result. 

Theorem  4.2.  In  the  set  up  of  theorem  (4.1),  assume  further  that 

(4.12)  h  is  bounded 

(4.13)  Tc-^—a .  .  exists  and  is  bounded,  (a..)  is  uniformly  positive  definite, 

3x^  ij  ij 

2 

(4.14)  There  exists  p  (x)  belonging  to  L  ([0,T],V)  such  that 

(a)  Ef(Xt)  =  /  f(x)pt(x)dx 

(b)  sup [ J  |  J— f(x)  jp  (x)dx]dt  <  00  ,  where  S  =  (feC  (Hd)  ,  ||  f||  ,  s  ll 

0  feS  tXi  Z  L“ 

Then  for  yfL*,  the  unnormalized  conditional  density  pt(*,y)  is  the  unique 
solution  to 

(i)  P # ( • , y)  €  L2( [0 ,T] ,y) 

(ii)  p0(*,y)  =  *(x1 


(iii)  ^pt(*,y)  =  ^Ptc-,y)  +  (ht,yt)pt(*.y) 


Proof:  We  have  already  shown  that  p  satisfies  (ii)  and  (iii) 
that  it  satisfies  (i) .  Observe  that 

l|ptC*,y)||  2  =  sup|/  f(x)p  (x,y)dx| 

1  L  feS  * 


We  now  show 


(4.15) 


=  sup  |  0.  (  f ,  y) 
feS  * 


-  CT  v  SUP  /  !f(x) |pt(x)dx 
1,7  feS  * 

=  CT,y^Pt(*5  ^2 


IT  2 

where  CT  =  exp(j/0|y|  du)  . 
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Also  II  *'X)  II  2  =  suPl/  C-g|7f)Pt(x,y)dx| 

i  L  feS  i 

-  "UP 

feS  1 

~  CT  vSup^l-57-fIPt(x)dx 

i»yfe5  axi  t 

Now,  (4.15),  (4.16)  and  the  assumption  (4.14)  implies  that  p(* ,y)£L^( fO,T] ,V) . 

To  complete  the  proof,  it  remains  to  show  that  (i) ,  (ii),  and  (iii)  have  a 

00 

unique  solution.  For  this,  fix  ycL^  -  Let  A(t)  denote  the  bilinear  form  on  V 
defined  hy 

<A(t)u,v>  =  -  2.  aijft>x^<^r»-5rr> 


\  a.  (t,x)<-^ — ,v> 
l  i  '  3x.* 
i 


♦  tCht(x) ,yt)-  2^t(x) ]<u,v> 


where  a.(t,x)  =  b^(t,x) 


1  d  3 

j  £  ^-a..(t,x).  Thus  for  smooth  f, 
3=1  3  1J 


<A(t)f,v>  =  <Itf,v>  +  (ht,yt)<f,v> 
and  hence  (iii)  can  be  rewritten  as 

3ptC*,y) 

(iii)’  — ^ - =  A*(t)pt(-,y) 

Under  the  assumed  conditions  on  a,b,  (iii)'  has  a  unique  solution  with  the 

2 

boundary  condition  (ii)  in  the  class  L  ([0,T],V)  (See  Theorem  1.1,  Chapter  IV  of 


Lions  [18)).  As  remarked  earlier,  this  completes  the  proof. 


5.  Classical  solution  of  the  Zakai  equation  and  relationship  between  white  noise 

and  Ito  approaches  to  filtering.  " 

We  now  proceed  to  study  the  existence  of  solution  and  the  uniqueness  problem 
for  the  white  noise  version  of  the  Zakai  equation  (Eqn.  (4.5)).  Recall  that  in 

Theorem  4.2,  uniqueness  is  established  for  the  solution  in  the  distributional 

00 

sense  for  yeL^.  Moreover,  condition  (4.14) (b)  of  Theorem  4.2  appears  difficult  to 
verify. 

Theorem  5.1  solves  the  nonlinear  filtering  problem  in  the  white  noise  formu¬ 
lation.  Existence  and  uniqueness  of  the  classical  solution  to  Eq.  (4.5)  is  esta- 

v 

blished.  To  do  this,  we  use  the  transformation  used  by  Rozovskii  (see  Benes  and 
Karatzas  [4])  so  that  the  potential  term  in  the  transformed  equation  is  bounded. 

Our  next  result  forms  the  connecting  link  between  this  paper  and  the  current 
work  in  the  literature.  In  this  respect,  the  significance  of  Theorem  5.2  is  two 
fold:  First  the  white  noise  solution  of  the  nonlinear  filtering  is  robust  in  the 
following  sense.  The  data  is  represented  wholly  by  the  Hilbert  space  without 
any  reference  to  a  larger  space  C( [0,T] ,  IR^  )  on  which  a  countably  additive  measure 
can  be  defined.  If  Yn,YeHT,Yn->Y  in  the  norm  of  Hj,  then  from  Theorem  5.2,  it  fol¬ 
lows  that  p^.(x,Yn)->-p^(x,Y)  uniformly  over  compacts  in  [0,T]xFd  .  Secondly,  Theo¬ 
rem  5.2  shows  that  the  results  of  the  white  noise  filtering  theory  are  consistent 
with  those  of  the  theory  based  on  the  Ito  calculus.  In  other  words,  even  if  one 
is  interested  in  the  unnormalized  conditional  density  p^(x,Y)  of  the  conventional 
Zakai  equation  where  Y  is  now  any  path  in  C( [0,T] ,  1R^  ) ,  one  can  obtain  it  using 
the  white  noise  theory.  Since  there  exists  a  sequence  YneH^,  converging  in  uni¬ 
form  norm  to  Y,  the  unique  solutions  (for  each  n)  p^(x,Yn)  of  the  white  noise 
Zakai  equation  (4.5)  converge  uniformly  over  compacts  to  p^(x,Y) . 

Theorem  5.1.  Assume  that 


(I)  The  initial  density  <J>  is  a  bounded  continuous  function. 


(TI)  aij*  3^ij*  bi‘  hi'  O^i  are  bounded  LiPscbi 

continuous  functions  in  [0,T]xRd  . 

Then  for  all  yeH,j.,  the  unnormalized  conditional  density  ptCx,y)  is  the  unique 
classical  solution  to  the  Zakai  equation 


(5.1) 


3ptCx.y) 


L*Pt(x,y)  +  (htCx),yt)pt  fx,y)  , 


PQ(x,y)  =  <J>(x) 

12  <5 

in  the  class  of  C  ’  (lO,T]xF  )  functions  satisfying  the  growth  condition 


(5.2) 


U  d  |g(x) 1 expC -k 1 x |  )dxdt  <  *> 

F 


for  some  positive  constant  k. 

Furthermore,  for  all  ycH^,,  there  exists  a  constant  Cy  such  that 


0  s  pt(x,y)  <  C  , 


for  all  (t,x)e[0,T]xF 


Proof:  Let  =  (YeCC [0,T] ,  Fd)  :  Y t  »  jJ  y(u)du,yeHT) 


Write 


where 


L*f  =  l  a.  —  +  l  b!  ~  +  Cf 
t  L  ij3x.3x.  L  1  3x. 


.  d  3a.  . 

b!  =  -b.  +  —  \  and 

1  1  2  . L.  3x . 

1=1  1 

,  _  d  3b.  .  d  32a. . 

C  =  '  2|h'  ‘  +  2  .  I  ‘ 

i=l  1  i,j=l  1  1 

In  view  of  condition  (II)  it  is  easy  to  see  that  the  coefficients  a..,b!  and  c 

1  j  1 

are  bounded  and  Lipschitz  continuous.  Now  let 


(5.4) 


*t(x,Y)  =  e 


-(ht(x),Yt) 


Pt(x,y) 


where  Yu  =  y(T)dt cH^.  Using  the  fact  that  p  satisfies  the  Zakai  equation  of 


3 


Section  4  we  have 


W,Cx,Y)  -(Mx),Y.K  (h  fx),Yt) 

— ^ -  -  e  r  1  L* [ e  r  tKtCx.Y)l 


3ht 


(The  calculations  are  similar  to  those  in  Benes  and  Karatzas  [4]) 


Thus  we  have 


^0(x,Y)  =  <j>(x)  ,  and 


(5.S) 


-5^- <l>t  ( • » Y)  =  U^t(*,Y)  where 


(5.61 


Y  -(h  C *) »Y  ) ^  (h  ,Y  ) 
ujf  =  e  r  L*[e  T  r  f] 


Sh 

-  c-sf’  Vf  • 


The  first  term  on  the  right  hand  side  of  (5.6)  becomes  after  simplification 

3  ,,  m  9f 


il(  'l  i?  “ij^WiTsir 

i  J  J  J  i 


i  Vsrisr'VV  *  il:tht-VH7(hfvt1|r 


1  J 


From  (5.6)  and  (5.7), 


i b;  3^vvf  * cf  ■ 


"!fw  =  l  aij(,'x,W^r  *  i  b1ft>x-Y)5r 


3f 


ij  '  dx.cix.  i - *  ;9x. 

i  1  i 


+  c(t,x,Y) f 


where 


b  (t,x,Y)  =  b!  (t,x)  +  l  ai,(t,x)^-fht,Yt) 

j  J 


and 
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c(t,x,Y)  =  c(t,x)  -  -g^-Cht , Yt) 


+  l  a.  .  k-^-5 --Ch»,YJ  +  l  a..  J^-Or.YJ*-^-,,  v  , 

L  11  dX.dX.  t  t  L  11  dX.  t  t  dX.(h.,Y 
J  1  J  J1  ltt 


*  l  »i  H7(hfyt’ 


1 

For  fixed  Y  in  H^,  b^  and  (being  products  of  bounded,  Lipschitz  continuous  func¬ 
tions)  are  bounded  and  Lipschitz  continuous.  Finally,  the  boundedness  of  h, 

2 

!£,  -3x3  implies  that,  for  any  Y1  and  Y2  in  H^,, 

i  i  j 


(5.8) 


|b.  (t.x.Y^-b  (t,x,Y2)  |  <  K  sup  |Y*-Y2 
i  i  0<t<T 


|c(t,x,Y1)-c(t,x,Y2) |  <  K  sup  |yJ-Y2|  for  all  t,x 

0<t<T 


where  K  is  a  constant. 

It  follows  from  Theorem  12,  Chapter  1,  Friedman  [9]  that  (5.5)  has  a  solution 

_  17  H 

x 1>  in  C  ’  ([0,T]x«  )  and  that  i:  satisfies  the  growth  condition  (5.2).  We  now  need 
a  result  which  under  the  conditions  (I),  (II)  establishes  the  uniqueness  of  the 
distributional  solutions  of  (5.5).  (By  a  distributional  solution,  we  mean  a  func¬ 
tion  which  satisfies  (5.5)  in  the  distributional  sense).  Friedman  has  proved 
( f 9 ] ,  Theorem  16,  Chapter  1)  the  uniqueness  of  the  classical  solution  to  the  pro¬ 
blem  in  the  class  of  functions  satisfying  (5.2).  However,  an  obvious  modification 
of  his  arguments  (pertaining  to  the  use  of  Green's  identity  in  the  proof)  shows 
that  the  solution  is  unique  also  in  the  class  of  functions  satisfying  (5.2),  where 
(5.5)  is  taken  in  the  distributional  sense. 

It  is  easy  to  see  that 

/Jj  d  |<J>t(x,Y)|dxdt  <  C j/Jj  dpt(x,y)dxdt 

R  R 
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so  that  ij>  =  ip.  Thus  ipeC  ’  ([0,T)*F  )  and  is  the  unique  classical  solution  to 

12  d 

(5.5").  This  in  turn  implies  that  p#(*,y)<rC  ’  ( [0,T]*Fa)  and  is  the  unique  clas¬ 
sical  solution  to  (5.1). 

In  view  of  the  assumption  that  <f>  is  bounded  and  an  estimate  on  the  fundamen¬ 
tal  solution  of  (5.5)  (see  (6.12),  Chapter  1  of  Friedman  [9]),  it  follows  that 
( x , Y)  is  bounded  by  a  constant  (depending  on  Y )  and  thus,  p^_(x,y)  is 
bounded  by  a  constant  which  we  denote  by  C  . 

y 

Theorem  5.2.  Assume  that  conditions  (I),  (II)  hold.  Let 

P^(x.Y)  =  pt(x,y)  ,  YcHt  ,  Yt  =  /*  yudu  . 

Then  the  map  Y-+p^(*,Y)  from  H ^  into  C([0,T]*Fd,  1R+  )  has  a  unique  continuous  ex¬ 
tension  to  YeC(  [o,T] ,  Fd)  . 

Further  (p ' ( • , Y) :  YcC(  [0,T] ,  Fd)}  is  the  unnormalized  conditional  density  for 
the  filtering  problem 

yt  *  /;  *  st 

where  (8^.)  is  Fd  -valued  standard  Brownian  motion. 

Proof:  The  same  arguments  as  in  Theorem  5.1  imply  that  (5.5)  has  a  unique 
solution  4^(x,Y)  for  all  YeC( [0 ,Tl  ,  Fd)  .  Let  p^(x,Y)  =  i^(x, Y)  exp((Yt,ht(x) ) . 

Then  p^(x,Y)  is  the  unnormalized  conditional  density  for  the  conventional  filtering 
problem  (5.10)  (see  Theorem  3.2,  Pardoux  [20]). 

To  complete  the  proof,  we  will  show  that  Y-*p)(*,Y)  is  continuous  for 
Y^C( [0,T] ,  Fd)  .  The  uniqueness  of  the  extension  follows  from  the  fact  that 
is  dense  in  C([0,T],Fd)  . 

Now,  let  Yn,  YeC(  [0,T] ,  Fd  )  be  such  that  Y^+Y  uniformly.  Let  f^(t,x)  = 
^T-tfx,Yn^  and  f^t,x^  =  ^T-t^X,Y^ ' 

To  show  that  p^(x,Yn)->p^(x,Y)  uniformly  on  compact  subsets,  it  suffices  to 

show  that  if  (t  ,x  Wt,x)  then  p1  fx  ,Y  )-*-p?(x,Y),  which  is  the  same  as 
n  n  ’  t  n  n  t 

n 

f  (t  ,x  )-*f  (t  ,x)  . 
n  n  n  v  ’  ' 


f) 


Let  a'(t,x)  =  a(T-t,x) 

b'(t,x)  =  b(T-t,x,Y) 

(5.9)  b^ft.x)  =  b(T-t,x,Yn) 
c'(t,x)  =  c(T-t,x,Y) 
c^Ct.x)  =  c(T-t,x,Yn) 

Let  (0  :  (s,x)e  [0,T]x]Rd  }  (respectively  On  )  be  the  solution  to  the  mar- 

S  )  X  S  t  X 

tingale  problem  for  (a',b')  (respectively  (a'.b^))  (see  Stroock-Varadhan  [23], 

Chapter  6).  Q  are  measures  on  C([0,T],IRd)  such  that  for  any  geC* [0,T] •  ]Rd)  , 
s ,  x  b 

(5.10)  g(t,x(t))  -  J*(-^  +  L^)g(T,x(T))dx 


is  a  0^  ^  martingale,  where 

x(t)  =  x(t,n)  =  n(t) 

is  the  coordinate  process  on  C([0,T],Fd)  and 


ai8)  ' .  I .  J,  bS(t'x>w78 

X,j=l  J  1  J  1=1  1 

By  using  an  obvious  stopping  time  argument  and  (5.10),  it  can  be  shown  that 


f(t,x(t))  -  /*(^ :  +  l;)f(x,x(T))dT 

is  a  r>s  x  local  martingale.  Now,  using  integration  by  parts  formula  for  martin¬ 
gales  (Theorem  1.2.8  in  Stroock-Varadhan  [23])  and  a  stopping  time  argument,  it 
can  be  shown  that 

/*c'(u,x(u))du  a  J^c' (u,x(u))du 

e  f(t,x(t))-/J^  ♦  L>  *  c'(x,x(x))]f(x,x(x))e  dx 

is  a  ^-local  martingale,  tefs,T].  But  the  fact  that  ip  is  a  solution  to  (5.5) 

implies  that 


♦  I.;  ♦  c'(T,x(x))]f(x,x(x))  =  0 
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and  hence  t 

I  c'(u,x(u))du 
e  s  f(t ,  x(t) ) 

is  a  (Js  x  local  martingale.  But  f  and  c*  are  bounded  and  hence  it  is  a  martin¬ 
gale.  Equating  its  expectations  at  t=s  and  t=T  and  recalling  that  f(T,x)  = 
ii'0(x,Y)=<Kx)  we  get 

(5.11)  f(s,x)  =  Eq  (4>(x(T) )  exp(/^c '  (u,x(u))du)) 

Similarly,  we  have  for  n2l, 

(5.12)  f  (s,x)  =  E  (<f>(x(T) )  exp(/Ic '  (u,x(u))du) 

C 

Now  let  (s  ,x  )-+(s,x).  The  condition  (5.8)  on  b',b'  implies  that  Qn 

n  n  n  r  xs  ,x 

n  n 

converges  weakly  to  0  (see  Theorem  11.1.4,  Stoock-Varadhan  [231). 

T5  ,  X 

Denoting  the  integrands  in  (5.11),  (5.12)  by  G(s,n)  and  G^fs.n)  respectively 
(the  variable  n  is  suppressed  in  (5.11),  (5.12)),  condition  (5.8)  on  c',c^  implies 
that  G  -»G  uniformly  in  (s,n). 

Now, 

|f  (s  .x  )-f(s,x)|  <  |E  (G(s,n))-E  (G(s,n))| 

11  n  n  y  n 


+  |E  t,  (g  (S  ,n)-G(s,n))| 

0 

s„,x 
n  n 

<  |e  (G(s,n))-E  (G(s,n))| 

S’x  Qs  ,x 

n  n 

+  sup|Gn(sn,ri)  -G(s  ,n)  | 

n 


-*•0  as  n  +  °" 

as  0n  -+Q  weakly  and  G  -*  G  uniformly, 
s  « x  s  i  n  n 

n  n 

This  completes  the  proof  of  the  theorem. 

Remark:  It  may  be  observed  that  p^fx.y)  is  a  non-anticipat ive  functional  of 
y  because  the  same  is  true  for  ct(f,y),  as  pointed  out  in  Section  2. 
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